Utterance verification using prosodic information for Mandarin telephone speech keyword spotting
نویسندگان
چکیده
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 INITIAL’s and 37 FINAL’s in Mandarin speech, and one background/silence model, are used as the basic recognition units. For utterance verification, 12 anti-subsyllable HMM’s, 175 context-dependent prosodic HMM’s, and five anti-prosodic HMM’s, are constructed. A keyword verification function combining phonetic-phase and prosodic-phase verification is investigated. Using a test set of 2400 conversational speech utterances from 20 speakers (12 males and 8 females), at 8.5% false rejection, the proposed verification method resulted in 17.8% false alarm rate. Furthermore, this method was able to correctly reject 90.4% of nonkeywords. Comparison with a baseline system without prosodic-phase verification shows that the prosodic information can benefit the verification performance.
منابع مشابه
Utterance Verification Using Prosodic Information for Mandarin Telephone Speech Keyword Spotting - Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference o
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 m s and 37 FINAL’S in Mandarin speech, and one backgroundkilence model, are used as the b...
متن کاملIntegration of phonetic and prosodic information for robust utterance verification - Vision, Image and Signal Processing, IEE Proceedings-
Mandarin speech is known for its tonal charactcristic, and prosodic information plays an important role in Mandarin speech recognition. Driven by this propcrty, phonetic and prosodic information are integrated and used for Mandarin telephone speech keyword spotting. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 132 subsyllable models, two ...
متن کاملTelephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verification
In this paper a fuzzy search algorithm is proposed to deal with the recognition error for telephone speech. Since the prosodic information is a very special and important feature for Mandarin speech, we integrate the prosodic information into keyword verification. For multi-keyword detection, we define a keyword relation and a weighting function for reasonable keyword combinations. In the keywo...
متن کاملDistributed Chinese Keyw Verification for Spoken Wireless Envir
With the rapid developments of wireless communications, it is highly desired for users to access the network information with spoken dialogue interface via hand-held devices at any time, from anywhere. One possible approach towards this goal is to perform speech feature extraction at the hand-held devices (the clients) and have all other recognition tasks and dialogue functions absorbed by the ...
متن کاملConfidence Measure for Utterance Verification in Keyword Spotting System
In this article, we propose an utterance verification technique for keyword spotting. The keyword spotting system analyzes a given spoken content and searches every speech segment in which one of pre-defined keywords is uttered. To maintain a stable recognition performance in the system, we propose an utterance verification technique that verifies whether a found utterance, or a candidate keywo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999